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Abstract. We consider an "unconstrained" random fc-XORSAT prob- 
lem, which is a uniformly random system of m linear non- homogeneous 
Boolean equations over n variables, each equation containing k > 3 vari- 
ables, and a "constrained" model where every variable appears in at least 
two equations. For k = 3, Dubois and Mandler proved that m/n — 1 is a 
sharp threshold for almost certain solvability of constrained fc-XORSAT, 
and analyzed the 2-core of a random 3-regular hypergraph to extend this 
result to the threshold value of m/n for unconstrained 3-XORSAT. In 
this paper we show that m/n — 1 remains a sharp threshold for solv- 
ability for every k > 3, and we use Molloy's analysis of the 2-core of 
a random fc-regular hypergraph to extend this result to the threshold 
value of m/n for unconstrained fc-XORSAT. 



1. Introduction 

An instance of fc-XORSAT is given by a set of linear equations modulo 2 in 
which each equation consists of k variables drawn from a set of n variables, 
and a right hand side, which is either or 1. Equivalently, it is a linear 
system Ax = b modulo 2 in which A is an m x n 0-1 matrix each of whose 
row sums is k, and b is an arbitrary 0-1 vector. 

Random instances of many problems of this sort undergo phase transitions 
around some critical ratio lim n/m _ i . 00 m/n = c* , meaning that for any c < c* , 
a sequence of random instances F n>m with m/n < c, the probability that 
F n>m is satisfiable (or possesses some similar property) approaches 1, while 
the probability approaches if m/n > c > c*. Friedgut [11] proved that a 
wide range of problems have such sharp thresholds, but with the possibility 
that the threshold c* = c*(n) does not tend to a constant. The relatively 
few cases in which c* is known to be a constant include 2-SAT, by Chvatal 
and Reed [3], Goerdt [12], and de la Vega [IT] , an extension to Max 2-SAT, 
by Coppersmith, Gamarnik, Hajiaghayi, and Sorkin [3], and the pure-literal 
threshold for a £>SAT formula, by Molloy |14j . 
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The most natural random model of the /c-XORSAT problem is the "un- 
constrained" model in which each of the m equations' k variables are drawn 
uniformly (without replacement) from the set of all n variables, and the 
right hand side values are uniformly or 1; equivalently a random system 
Ax = b is given by a matrix A € {0,l} mxn drawn uniformly at random from 
the set of all such matrices with each row sum equal to k, and b S {0, l} m 
chosen uniformly at random. 

The case k = 2 has been extensively studied. According to Kolchin [13] 
and Creignon and Daude [5], for c := 2m /n the random system has a solution 
with limiting probability p(2m/n) + o(l), where p(x) € (0,1) for x < 1, 
p(l— ) = 0, and p{x) = for x > 1. Daude and Ravelomanana [6], and Pittel 
and Yeum [15] analyzed the near-critical behavior of solvability probability 
for 2m I n = 1 + e, e = o(n~ 1 / 4 ). 

For k > 2, Kolchin [13j analyzed the expected number of "critical row 
sets" , those with even column sums; their presence is necessary and sufficient 
for the (Boolean) rank of A to be less than m. He determined the thresholds 
Cfc such that the expected number of critical sets goes to zero if lim m/n < 
and to infinity if lim m/n > c^; in particular, C3 = 0.8894... Thus for 
lim m/n < c^, A is of full rank with probability tending to 1, implying 
Ax = b is with high probability satisfiable, meaning that the solvability 
threshold c* k is at least c* k > On the other hand, the expected number 
of solutions of Ax = b is 2 n ~ m , whence P(^4x = b is solvable) goes to zero 
for lim m/n > 1, so c£ < 1. So Kolchin's approach narrowed the location 
of the solvability threshold to the interval [ck , 1] , but left open whether this 
threshold is actually q,.. (In fact, even the existence of a threshold remained 
unclear.) Even though the expected number of critical row sets goes to 
infinity for lim m/n > c^, it is still possible that the number of critical sets 
itself does not go to infinity in probability. 

Dubois and Mandler [9] (see also [10]) introduced a "constrained" random 
fc-XORSAT model, where b is still uniformly random, but A is uniformly 
random over the subset of matrices in which each column sum is at least 2. 
For k = 3 (3-XORSAT) they showed that its threshold is lim njm _ >00 m/n = 
1. They used this surprising result to determine the threshold C3 ~ 0.9179 £ 
(03, 1) for the unconstrained model, thus completely answering Kolchin's 
question for k = 3. Explicitly, they showed that 

z* 

W c 3 = 3(i_ e -**)2' 

where z* is the unique positive root of 

(2) (z-3)e z + 3 + 2z = 0. 

The key idea for translating between the unconstrained and constrained 
models is that the 2-core of the 3-uniform hypergraph underlying a uni- 
formly random unconstrained instance is a uniformly random constrained 
instance with a predictable density m/n, and the threshold value for the 
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unconstrained model is that for which the density of the core is 1. Dubois 
and Mandler derive a sharp asymptotic formula for the density of the core 
by analyzing a 2-core algorithm that successively eliminates any variable 
involved in no equations or just one, along with the associated equation if 
any. If the core is empty, then the initial system of equations Ax = b is 
satisfiable. With high probability, Ax = b remains solvable on the event 
"the core is nonempty, but its row density is below 1" . Dubois and Mandler 
concluded [9] with the suggestion that their methods could be extended to 
the general constrained fc-XORSAT, k > 3. However, their approach — the 
second moment method for the number of solutions — requires solving a 
hard maximization problem with 0(fc) variables, a genuinely daunting task. 

In this paper we prove that lim m/n = 1 remains the solvability threshold 
for the general constrained fc-XORSAT, for all k > 3. The crucial fact is 
that, for k > 3 and lim m/n < 1, the expected number of critical row sets 
approaches at a rate n~^ k ~ 2 \ What makes the asymptotic analysis doable 
is that in this, Kolchin-inspired, dual approach, the inevitable maximiza- 
tion problem is considerably simpler, involving just a bivariate function, no 
matter how large k is. We think that our argument provides a transparent 
explanation for why lim m/n = 1 stubbornly remains the threshold for the 
constrained fc-XORSAT for every k > 3. 

Of course, once the constrained threshold of 1 is established for all k > 3, 
the reduction idea of Dubois and Mandler can be used to determine the 
unconstrained thresholds ct. This time the issue is determination of the 
likely edge density for the 2-core of a random fc-uniform hypergraph. Molloy 
computed this in [TJ], and we use his results to show that, for k > 3, 

z * 

W C k = T7i -z*\h 1 ' 

k(l — e z k) k ^ i - 
where z^ is the unique positive root of 
(4) (z - k)e z + k + {k - l)z = 0. 

For k = 3, we are back to Dubois-Mandler's (P)-©. 

While working on this project, we became aware (thanks to Alan Frieze) 
of a very recent extended abstract by Dietzfelbinger, Goerdt, Mitzenmacher, 
Montanari, Pagh, and Rink [7] on "cuckoo hashing". It claims a proof of 
the constrained and unconstrained fc-XORSAT thresholds, but the proof, 
influenced by [9|, does not seem to be complete. 

The remainder of the paper is organized as follows. In Section [2] we in- 
dicate that for the two-moments method for the number of solutions to 
be applicable in principle, it is necessary and sufficient that the expected 
number of non-empty critical row sets approaches zero. Thus we have a 
surprising reduction to the first order moment treatment of a dual charac- 
teristic of A, i. e. Kolchin's approach. This is fortunate since the expected 
number of those row sets is quite amenable to asymptotic evaluation no 
matter how large fc is. 
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In Section [3] first we show that, for the constrained model, instead of 
considering random 0-1 matrices A, it is asymptotically equivalent to con- 
sider uniformly random nonnegative integer matrices A subject to the same 
constraints on row sums (= k) and column sums (> 2). Second, using gen- 
erating functions and ChernofF's method, we obtain an exponential bound 
for the expected number of critical sets of any given cardinality. Third, we 
use this bound to show that, for limm/ra strictly below 1, and k > 3, the 
expected number of all critical sets is of order n~( k ~ 2 ^ . Thus, there is un- 
likely to be any such set. Hence, with high probability, A is of full rank, 
and the system is solvable. On the other hand, since the expected number 
of solutions of Ax = b is 2 n_m , for lim m/n > 1 it tends to 0. So, with high 
probability, there are no solutions. We conclude that lim m/n = 1 is a sharp 
threshold for solvability of Ax = b in the constrained case for all k > 3. 

Finally in Section U] we give a derivation sketch for the unconstrained 
thresholds ct, k > 3, based on Molloy's results |14j . 

2. Proof background 

Let N be the number of solutions for a random formula, i.e., to the system 
of equations Ax = b for a random pair (A,b). By the first-moment method, 
the probability that a random formula is satisfiable satisfies 

¥(Ax = b is satisfiable) = F(N > 0) < E[N\. 

Let A be distributed arbitrarily over all m x n matrices, with 0, 1-entries, 
and let b be independent of A, and uniformly distributed over {0, l} m . As 
Dubois and Mandler [9] observed, regardless of the distribution of A, 

E[N) = T~ m . 

Since E[iV] — > if lim m/n > 1, this already establishes that, for any distri- 
bution of A, the satisfiability threshold c* satisfies c* < 1. 

To show that c* > 1 for k = 3 and A distributed uniformly over a con- 
strained set of matrices, with all column sums 2 at least, [9] used the second- 
moment method based on 

F(Ax = b is satisfiable) = F(N > 0) > E[N] 2 /E[N 2 }. 

The task was to show that E[iV] 2 /E[./V 2 ] — > 1 for lim m/n < 1. Associating 
iV 2 with the number of ordered pairs of solutions {x\, X2} to Ax = b, they ex- 
pressed E[iV 2 ] as the sum of the expected numbers of those pairs xi,X2 over 
all possible values of the numbers of common components of x\,X2 and their 
locations in the equations. Determination of dominant contributors to that 
sum and demonstration that E[iV 2 ] is indeed asymptotic to E[iV] 2 required 
an ingenious analysis of an attendant parametric maximization problem, 
with two variables and two parameters. Dubois and Mandler concluded 
with a remark that their method could be extended to /c-XORSAT for any 
k > 3. However, our attempt to do that failed: the number of variables 
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grows linearly with k, making the corresponding parametric maximization 
exceedingly problematic. 

To explain the approach we adopted in this paper, it is instructive first 
to derive an alternative representation for E[iV 2 ]/E[./V] 2 , under the assump- 
tion that A and b are independent and b is uniform. By elementary lin- 
ear algebra, for any fixed A, having rank r(A) and nullity n{A) over J-~2, 
Ax = b has 2 m ~ r ^ solutions for each of the 2 r ( A ^ admissible values of b 
(those in {Ax: x 6 {0,1}™}), and for the remaining values of b. Then, 
E[iV 2 ] = E[2 2n " r ( A )- m ], and 

E[iV 2 ]/E[iV] 2 = E[2 m ~ r(A) ] = E[2 n(AT) ]. 

Interpret 2 n ^ AT \ the number of solutions to yA = 0, as the number of critical 
row sets, that is, subsets S C [m] such that all the column sums X^ie5 ai >i' 
j G [n], are even; see Kolchin [I3j Section 3.5]. Thus, defining 

X = # of non-empty critical row subsets of A, 

we have E[iV 2 ]/E[iV] 2 = 1 + E[X] -> 1 iff E[X] ->■ 0, regardless of the 
distribution of A. In words, the second moment method for N, the number 
of solutions, works iff M[X] — > 0, where X is a dual parameter, the number 
of non-empty critical row sets of A. And it turns out that counting the 
expected number of critical sets is considerably easier than counting the 
expected number of pairs of solutions. 

While we will not need it, it is worth mentioning that, under M[X] — > 0, 
substantially more holds true. For if 

E[2 m - r ^-l] =E[X] -+0, 

then P(r(j4) = m) — > 1, as r(A) < m. (This can happen only if n > m, 
of course.) In such a case, with high probability, for every b whence for 
the uniformly random b, we have N = 2 n ~ m , not just N/2 n ~ m — > 1 in 
probability. 

We will show that indeed for k > 3 M[X] — > 0, if c := m/n satisfies 
lim mjn _ i . 00 c < 1, and A is uniformly distributed according to the con- 
strained model. The inevitable calculus problem turns out to be relatively 
tractable. To be more specific, we show that the expected number of crit- 
ical row sets of cardinality [i = an is bounded from above, roughly, by 
min z exp[niffc(a, z; c)] , where Hk is defined in (fl~8j) . and z = (zi,^) > 0. 
The task of finding the best z = z(a) and showing that H} t (a,z(a); c) < 
is hard, but we produce an explicit z(a) for which hk(a; c) := Hk(a, z(q); c) 
is proved to be negative. 

3. Counting critical row subsets 

3.1. Probability spaces. Remember that throughout Section [3] we work 
with the constrained model. Let A m ,n denote the set of all m x n matrices 
with 0-1 entries, such that all m row sums are k, and all n column sums are 
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at least 2. For A m ,n to be nonempty, it is necessary that km > 2n. We will 
assume that m , n — >■ oo in such a way that m = G (n) and 

k—-2j> 0. 

The (Boolean) rank of a matrix A £ ^4 m)n is strictly less than to iff A has a 
non-empty critical row set. 

A matrix A G *4 m! n may be interpreted as an outcome of the following 
allocation scheme. We have an m x n array of cells with k indistinguishable 
chips assigned to each of the to rows. For each row, the k chips are put in k 
distinct cells (so there is at most one chip per cell), subject to a constraint: 
every column gets at least two chips. Instead, we will consider the chips 
in every row to be distinguishable, so that each matrix A is obtained from 
(k\) m allocations (all that matters is whether a cell is occupied, not by which 
chip). Let B mi n be the set of all feasible allocations in this scheme. Clearly, 
the uniform distribution on A m ,n is translated into that on £> mj „. Let C mj „ 
be a relaxed version of B m ^ n : we do not require anymore that each of the mn 
cells gets at most one chip. Let B and C be distributed uniformly on B m n 
and C mj „, respectively. Crucially, and obviously, B is equal in distribution 
to C, conditioned on C € £> m , n - 

To state a key lemma on |*4. mi „|, |£> min |, and |C miTV |, we need some notation. 
Introduce 

f(x) = 2^— = e x -l-x. 
i>2 3 ' 

By ©, for m,n large enough (which we assume from now on), km/n > 2 
and is bounded away from 2. Any root of 

xf'(x) km 

(6) 77 v = = k c 

f(x) n 

is a stationary point of nln/(x) — kmlnx. Since ip(x) := xf'(x)/f(x) is 
strictly increasing (see Note 1), and ip(0+) = 2, ip(x) ~ x for x — > oo, 
dH) has a unique root, which is bounded away from zero, the point where 
nln/(x) — kmlnx attains its absolute maximum. Henceforth, let 

A = \(kc) 

be the unique root of ([6]) . 

Introduce a truncated Poisson random variable Z = Z(X), 

F(Z(\)=j) = ^jL, j>2. 

Observe that the probability generating function (p.g.f.) of Z(X) is given by 

[ J /(A) ' 



SATISFIABILITY THRESHOLD FOR fc-XORSAT 



7 



in particular, 



E[Z(A)] 



d f(z\) 



dz /(A) 



z=l 



sec 



and 



(7) Var[z(A)] = ^7(Ay + ^7(Ay 

,2 fll 



A/'(A) _ fcm 

7(aT 



d f(z\) 
dz /(A) 



z=l 



AV(A) , A/'(A) 



+ 



Af(A) 

/(A) ' /(A) L /(A) 
uniformly for A > 0. 

Note 1. That ip(x) strictly increases for x > follows from 

rj/( x ) = ^^ff = x- 1 Var[Z(x)] > 0, (x > 0). 
ax J\X) 



©(A)) 



Lemma 1. 

(8) 

(9) 



\Cm,n | 

IS I 

\ kJ m,n I 
Y-"m,n | 



1 +0(n- 



^27^raVar[Z(A)] 

it - 1 Ae A 

exp 



(km)\ 



/(A)" 
+ o(l), 



2 e A - 1 

so i/iat the fraction \B m ,n\/\Cm,n\ is bounded away from zero. Consequently 

B m ,n\ _ l + o(l) (fern)! 



|*4m,n | 



(fe!) m v / 2vrnVar[Z(A)] (£!) r 



(10) 



/(A) n 



■ exp 



fc - 1 Ae^ 



Note 2. For any U C S m , n , 

P(B G H) = P(C G ft | C G B m , n ) 

_ P(C G C G B n , m ) < P(C G 

|^m,n| / |Cn,m| |^m,n| / |Crx,m| 

Using this bound together with Q, we see that P(C G — > implies 
P(JB eH)->0. Replacing U by its complement gives that P(C G 7£) ->■ 1 
implies P(i? G T~L) — > 1. In words, to show that an event in the sample 
space B mtn is likely or unlikely, it suffices to prove that the event is likely or 
unlikely in the broader sample space C mjn , if both spaces are equipped with 
the uniform probability measure. 

Proof of Lemma [TJ, Equation (jlOp is immediate from ([8j) and © . 

We first prove (jSJ). To determine |C m>n |, recall that each row i G m is 
given its own A;, mutually distinguishable, chips. This means all km chips 
are distinguishable, and each chip is affiliated with a row. Pool all km chips 
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together, and allocate them among n columns, subject to the constraint that 
every column gets at least two chips. Cell gets those chips (if any) that 
are affiliated with row i and are allocated to column j. 

We can get such an allocation by first permuting all the chips, then allo- 
cating the first ji > 2 chips to column 1, the next j2 > 2 chips to column 2, 
etc., noting that the permutation within each group is irrelevant. Adopting 
the notational convention that for h(z) = Y^j hjZ 3 , we define 

[j]h{z) :=hj. 

We thus have 

\- {km)\ 

\<~m,n\ ~ 2^ ,■.!...,■! 

. . . . J 1 • Jn ■ 

JlH hjn = fe™. 

Jl >n>2 

= (km)\ [z km ] ( 4 J = i zkm ]f( z ) n 

(11) \3>2 j ') 

where Zi(A), . . . , Z n (X) are independent copies of Z(A). Since 

rt 

5^E[Z,-(A)] = nE[Z(A)] = /cm, 
i=i 

and limA > 0, by a local limit theorem (Aronson et al [T], pages 174-176), 
the last probability is 

1 + Q(n~ 1 ) 
v /27rnVar[Z(A)] ' 

which proves (j8jh 

We now prove ([9]). Let C = {cjj} be distributed uniformly on C mn . 
Let M denote the total number of cells that house 2 or more chips, i.e. 

M=\{{i,j) : a j >2}|. 

Let iV be the total number of pairs of chips hosted by the same cell, i.e., 

£ (?) = E( c r 

(i,i) :Ciij >2 V 7 (ij) V 

A?" = M iff there are no cells hosting more than 2 chips. Clearly 
^4=P(CG^ m , n )=P(iV = 0). 
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Of course, F(N = 0) = ¥(M = 0), but, unlike M, N is amenable to moment 
calculations. 

Denoting the indicator of an event E by 1(E), we write 
N= ]T Yl l{E(i,j;u,v)), 

i&[m],j£[n] l<u<v<k 

where E(i,j; u, v) is the event that, out of k chips owned by row i, at least 
the two chips u and v, were put into the (i, j) cell. Each of these mn(^) 
event indicators has the same expected value, 

(12) E[E(i,j;u,v)] = (km - 2)1 l - . 

To see this, note that once we have put two selected chips into an (i,j)-cell, 
we allocate the remaining (km — 2) chips among n columns, at least two per 
column, except allowing the jth column to receive an unconstrained number 
of additional chips (it already has two), whence the sole e x factor. Arguing 
as in (fTTI) . 



n-1 



(13) [z fcm - 2 ]/(*) n - V = l \l m ^ P \J2 Z M) + Po(A) = km - 2 J , 

where Po(A) stands for an independent, usual (not truncated) Poisson vari- 

able. The probability is again asymptotic to (27rnVar[Z(A)]) . Therefore, 
using © and km/n = A/' (A)// (A), 



mn 



(km) 2 /(A) 

where 

k - 1 Ae A 



(14) 7 
K J ' 2 e A - 1 

More generally, we claim that for every fixed t > 1 we have 

(15) E[(iV) t ]=7 t + (l), 

where, as usual, (a)& := a(a — 1) • • • (a — 6 + 1). 

Let us prove flE]). Denoting i = (ii,...,t t ), j = (j 1 ,...,j t ), u 
(ui, . . . ut), v = (vi, . . .,v t ), we have 

ie[m] 4 , i€[n]* \s=l / 

Hence 

ieM*,j'e[n]* \s=i / 
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We break the sum into two parts, Si and the remainder £2, where Si is the 
restriction to % and j, each having all its components distinct. In Si the 
total number of summands is (m)t(n)t(^) , and each summand is 

y~"m,n I 

see the explanation following (fT2j) . Analogously to (fT3j) . 



n—t 



[xkm -2t ]f{x) n-t ien t = f&ypk I P l £ Zi(A ) + ^Po s (A) = km - 2t 

\j=l 8=1 

where Po s (A) are independent copies of Po(A), that are also independent of 

Zi(A), . . . , Z n -t(X). As before, the probability is asymptotic to (27rraVar[Z(A)]) 
So, using (jHJ) and recalling (THj) . we have 

*~ (fcm)2t V/(A)y 



1/2 



(16) 



mnQ A 2 e A 



(fcm) 2 /(A) 



7*. 



In the case of S 2 , denoting / = . . . , J = {ji, . . . ,j t }, we have |/| + 
\J\ < 2t — 1. So the total number of attendant pairs (I, J) is at most 
(m + n) 2<_1 = O^n 24 " 1 ). And the number of pairs (i,j) inducing a given 
pair (/, J) is bounded above by a constant s(t). For every one of those s(t) 

choices, we select pairs of chips for each of the chosen t cells, in at most (2) 
ways overall. Lastly, we allocate the remaining (km — 2t) chips in such away 
that every column j £ [n] \ J gets at least 2 chips. As in the case of Si, this 
can be done in 



(km - 2t)\ [x km ~ 2t ] f(x) n -\ J \(e x ) lJ 
f(\)n-\J\( e \yj\ 



n-\J\ \J\ 

Zj(X) + ^2 Po s (A) = km - 2t 

3=1 s=l 



\ 3 = 1 s=± / 

1/2 



number of ways. Again, the probability is asymptotic to (2-7rnVar[Z(A)]) 
So, as e A > /(A), the sum S2 is of order 



Combining ()16p and (|17p . and recalling (|14p . we conclude: for each fixed 
t > 1, 

E[(N) t ]= 7 t + o(l). 
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Therefore N is asymptotic, with all its moments and in distribution, to 
Po(7). In particular, 

p(iV = 0) = P(Po(7) = 0) + o(l) = e" 7 + o(l). 

This completes the proof of Lemma [1] □ 



3.2. Main result. Armed with Lemma [TJ we will prove our central result. 



Theorem 2. Suppose A is chosen uniformly at random (uar) from the set 
A 

mn- Let X m ^ n denote the total number of non-empty critical row subsets 
of A. Suppose k > 3. If m = m(n) is such that bim^oo c < 1, then 
P(X m , n = 0) -4 1. 

Proof of Theorem O By Lemma [H the discussion preceding it, and Note 
2, we need only show that E[y" mjn ] — > 0, where Y mn is total number of non- 
empty critical row S6ts for th.6 uniformly rsnidom matrix C £ Cm,n' 

The 

proof rests on the following key claims. 

(£) 

Lemma 3. Let Ym 'n denote the total number of the critical row sets of 
cardinality £, 2 < i < m. Then, introducing z = (z\, z 2 ) > 0, 

(18) EKL] < b ^ exp[nH k (a, z; c)] , 
where 

(19) H k (a,z;c) 



-c(k-l)H(a) 



+ ck In A — cka In z\ — ck(l — a) In z 2 + In 



f{z\ +z 2 ) + f(z 2 - Zi) 



and 



H(a) := a In — \- (1 — a) In 

a 1 — a 



2/(A) 
, (entropy function). 



The reader certainly anticipates that, for a proper choice of z, the expo- 
nential factor in f 1 1 8 [) is dominant relative to (A/^) 1 ^ 2 - 

Lemma 4. Introduce hk(a;c) := inf 2 H k (a, z; c). Then, for k > 3 and 
c < 1, 



(20) 

More precisely, let a k ■- 



h k (a;c) < 0, Va G (0,1]. 



: then 



hk(a;c) < < 
where (3 = (3(c, k) > 0. 



ca In 

-P, 



<3 



fc/2 



fc/2— 1 



a G (0,0.999a fe ], 
a G [0.999a fc ,l], 
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We will prove Lemma 2] by using hk(a;c) < Hk(a,z;c) for a properly 
chosen z = z(a). For this choice of z we will have A/22 = O(l) uniformly 
for a. 

Proof of Lemma [3J, By symmetry, 
(21) E[yWj = Qp(P,); ^ := p|^^.iseven|. 

By symmetry again, 

p ( D «) = £("Wv). 

T^>i,v := n 1 e c * <j is even ' p° sitive r n n i e c ^ = ° r ■ 

j=l U=l J j"=i/+l U=l J 

Recalling that X^e[m] c *.i — ^> we see that on the event T>^ v 

even > 0, j < v, ( 0, j < 2/, 



. . f even > 0, j < v, f 

(23) £<* = { 5>^{ 



2, J>I/. 



Thus on £>^„ the column sums of the two complementary submatrices, 
{°i,j}i<e,je[n] an d i c i,j}i>e,je[n]j are subject to the independent constraints. 

Let C m ^ n {£,v) denote the set of all matrices C with row sums k, which 
meet the constraints (|23|) . Then ¥(T>i u ) is given by 

(24) p(l,v) ;= p(p i>y ) = J^g^H . 

By the independence of constraints on column sums for the upper and the 
lower submatrices of the matrices C in question, 

\Cm tn (£, v)\ = a(£, v) ■ b(m - £, v). 

Here, just like |C m) n|, (1) a(£,v) is the total number of ways to assign k£ 
chips among the first v columns so that each of those columns gets a positive 
even number of chips, and (2) b(m — £, v) is the total number of ways to 
assign k(m — I) chips among all n columns so that each of the last (n — u) 
columns gets at least 2 chips. 
As in (fTTJ) , 

{k£)\ 



a(£,u)= E 



j s >0, even 

V 



(k£)\ [z M ] [ e j ) = w [^](costz- ir, 



, J>0, even 
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and 

b(m-£,u)= ^ — — , 

=(k(m - £))\ [z k ^ m -\e z ) u f{z) n - y . 

Since the Taylor coefficients at z = of both (cosh z — l) u and e zv f(z) n ~ u 
are non-negative, we use these identities in a standard way to bound 

(25) 

( p Zl\V £( v \n — v 

b(m-£,v)<(k(m-£))\ [e ' , V z 2 > 0. 

Z 2 

The guiding intuition is that, for z\ and z 2 minimizing the RHS's of (|25p . 
the resulting bounds are the best possible, at least in terms of the ratio of 
corresponding logarithms. We also have 

(26) b{m-e,u) < b (nz 2 )- l l 2 {k(m-i))\ ^J^f^ , Vz 2 > 0. 

(We use A <b B to indicate that A = 0(B), uniformly over all arguments 
in the expressions A and B. Later we will use this notation if, for every 
fixed k, A = O(B), uniformly over all other arguments.) The bound (I26D is 
stronger than the second bound in (I25D if nz 2 is "very large". (1261) follows 
from the Cauchy integral formula 

z=Z2e . 
9e(-ir,Tr] 

| e z| _ e z 2 exp[— 22(1 — cos#)] , and less obvious 

\f(z)\ < |/(^ 2 )|exp[-z 2 (l-cos0)/3]. 

(See Pittel |16j for inequality, and [I] for how it works in combination with 
the Cauchy formula.) 

Using ([ID, dMI, first bound in ([25]), ([26]) and ([8]) from Lemma Q] together 
with ([7]), we obtain 

f\ fkmy 1 A fcm [e 22 (coshzi - l)] v f(z 2 ) n ~ v 

\l zi > 0, z 2 > 0. Now, given n, m, the best Z\, z 2 certainly depend on both I 
and v. Lowering our sights, we are content to use z\ and z 2 , whatever they 
will be, that depend on I only. The reason is that combining this bound 
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and ([22]) . (f2T1) . we obtain a bound 

«Msj^yi(7)(«r* 



(27) x ^p^l 7w J ' V21 ^ >0 ' 

that does not look forbiddingly complex. Observe that 

f(zi + z 2 ) + f(z 2 - Zl) 



/(^) + e 22 (coshz 1 -l) 

Introducing a = l/m, z = (zi,z 2 ), using the Stirling formula for factorials 
and exponentiating, we arrive at (fT8]) - (fT9|) . □ 



Proof of Lemma |4l First, as f(x) > f(—x) for x > 0, 

Ml; c) < (A, 0); c) = In ^Ij^"^ < 0, 

and it is bounded away from zero because A = X(kc) is. So, by continuity 
of h k {a; c) as a function of a, c, 

(28) sup h k (a; c) < 0, 

ae[i-6,i\ 

if (5 = <5(c, fc) > is small enough. So we now focus on a < 1 — <5i. Our task 
is to determine a proper z = z(a) for which H k (a, z; c) < for a £ (0, 1 — <5i] 
and c < 1. For guidance, let us assume that the infimum h k (a; c) is attained 
at some z*(a;c) = {z\(a; c), z 2 (a, c)). Such z*(a;c) is a solution of 

dH k (a,z;c) _ _ c/ca /'(zi +^2) - /'(z2 - zi) = Q 
, 29 ^ <9^i zi f(zi + z 2 ) + f(z 2 - Zl) 

1 ) dH k (a, z; c) _ cfc(l - a) | f(gi + z 2 ) + f(z 2 - Zl ) _ Q 

<9z 2 Z 2 /(Zl + 22) + /(z 2 - Zl) 

Consequently, 

(Z2 " + Z 2 ) + (Z 2 + Zl)/'(Z 2 - Zl ) 



ck(l - 2a) 



f(zi + z 2 ) + /(z 2 - zi) 



therefore zl(a;c) < z 2 (a,]c) for a < 1/2, as the RHS is non-positive other- 
wise. For a > 1/2, define 

zl*(a; c) = z 2 (l — a; c), z 2 *(a; c) = z*(l — a; c). 

By ([I9D, and f{x) < f(\x\), 

H k (a,z**(a;c);c) < H k (a,z*(l - a;c);c); 
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hence it suffices to consider a G (0, 1/2] only. Another easy corollary of (j29|) 
is 

f(zi + z 2 ) 



ck = ip(zi + z 2 ) 



f(zi + z 2 ) + f(z 2 - Zi) 

f(z2 - Zi) 



(30) + VO2 - zi) 

f(Zl + z 2 ) + f{z 2 - z\) 

ip{x) := xf'(x)/f(x). So ck = ip(X) is a mean of the values of ip{x) for 
x\ = z\ + z 2 and X2 = ^2 — z\. Since ip(x) is increasing, we obtain 

(31) z^a; c) — zl(a; c) < A < 2*(a; c) + ^2( a ; c )- 

It is also immediate from (|29|) and ^|(a; c) — ^(a; c) > that 

(32) < 



z$(a;c) 4(a;c)' 

It follows from ([3D]) and (J32]) that 4(1/2; c) = 4(1/2; c) = A/2. This 
observation suggests that, for a < 1/2 and not too far from 1/2, a good 
candidate for z(a) might be 

(33) zi(a;c) = Xa, z 2 (a; c) = X(l — a) . 

An additional reason for this choice is that the resulting H^a, z(a); c) given 
by ([19]) becomes 

/(A) + /(A(l-2a)) 



(34) H k (a,z(a; c); c) = cil(a) + In ■ 



2/(A) 



= ff fc (a; A) := ^(a) + In /(A) + ^~ 2a)) , 

a relatively compact expression. Further, a close inspection of (|29p shows 
that, for a small, z*(a\ c) and 4( a ) c ) are asymptotic to AG(a 1 / 2 ) and A(l — 
a) respectively. So let us define 

zi(a; c) = Aua 1 / 2 , z 2 (a; c) = A(l — a), 

with « = u(a;c) to be determined. Notice that z 2 = 0(A) for all a < 1/2. 
By analogy with z(a; c), for a G [1/2, 1 — <5i] we define 

Zl(a; c) = £2(1 — a; c), £2(0; c) = zi(l — a; c). 

So, for all a G (0, 1 — Si], z 2 (a; c) = 0(A), which makes the factor (A/^) 1 ^ 2 
in (fl~8|) of order 0(1). And, like z*(a;c), it suffices to consider a G (0, 1/2] 
only. 

Let us use H k {a, z(a; c); c) as an upper bound for /ifc(a; c). After cancel- 
lations, we have 

//c \ 1 1 

(35) Hk(a, z(a; c); c) = —c [ 1 I a In — h c(l — a) In kca Inu 

\ 2 J a 1 — a 

f \X(1 - a + ua 1 / 2 )! + / [A(l -a- ua 1 ' 2 )} 
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Now, In f(x) is convex, since 



m,A,r ...; r / '^ 



P{ x ) 



So, using V(A) = A/'(A)//(A) = fee, 



(36) 



f[X{l-a±ua 1 / 2 )] 



/(A) 

Combining this estimate with 



< exp 



Af(A) 
/(A) 



(-a±TO 1/2 ) 



In cosh x < — , (1 — a) In < a, 



we reduce (1331) to a bound 



' fe \ 1 

H k (a, z(a; c); c) < — c ( 1 J am — \- cot — kca Inu 

2 / 



kca 



(kcua 1 / 2 ) 2 



u = u(a; c), that minimizes this upper bound, is (fee) 1//2 , and we have 
(37) Hk(a, z(a; c); c) < ca In 



Now, the RHS of (ETJ) is negative for all c < 1 iff 



(38) 



a E (0,a k ), a k := 



k ' 



(a fc < 1/2 for all k > 3.) Thus 

(39) h k (a-c) < H k (a,z*(a;c);c) < 0, Va€(0,a fc ). 

For a 6 [0.999afc, 1/2], we use z\(a;c), Z2(a;c) defined in (f33l) . with the 
corresponding H k (a, z(a; c); c) given by (|3~4]) . 

Lemma 5. /ij VK 2 -) = x f'( x )/f( x ) ^ s concave. (2) Consequently, if a £ 
(0,1/2], A > are such that H k (a;X) > 0, then dH k (a;X)/dX > 0. In 
words, as a function of X, H k (a; X) is strictly increasing when it is non- 
negative. 

We will prove Lemma [5] at the end. The part (2) of this Lemma implies an 
important corollary. 

Corollary 6. For c < 1, 

(40) {a € (0, 1/2] : H k (a, z(a; c); c) > 0} 

C {a 6(0, 1/2] :# fe (a,z(a;l);l) > 0}. 

Equivalently, H k (a, z(a; c); c) is negative for all a 's such that H k (a; X k ) < 0. 
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Proof of Corollary [6l If for some c < 1 and a £ (0,1/2], we have 
Hk(a,z(a; c); c) > 0, then Hk(a;X(kc)) > 0, where ip(\(kc)) = kc. By 
Lemma [5j Hf-(ot;X) > for all A > X(kc), and, in particular, for Afc = 
A(fcc)| c= i, which means that Hk(a,z(a; 1); 1) > 0. □ 

As f'(x)/f(x) > 1, it follows that A& < k. By induction on k, A& > k — 1 
for all > 3. In addition, ip(Xk) = k is equivalent to 



A fc + 



4 



/(A fc ) 



so that 
(41) 



< i, 



which will come handy in a moment. Since 

1 



//"(a) 



a(l — a) 



<-4, 



we have 



H(a) - #(1/2) = if (a) - In 2 < --(1 - 2a) 2 . 



Also, as f(x) < e x x 2 /2 for x > 0, 



/(A(l-2 a ))<e A ( 1 - 2 ") A2(1 ~ 2a)2 , a<l/2. 



So 



H k (a;X k ) =H(a) - In 2 + In 



/(A fc (l-2a)) 
/(A fc ) 

A fc e A fc (l-2a) 



/(A fc 



<0, 



if 

(42) 



a > «fc := ^ 



dfc < 1/2 by (141) . and 5^ > as /(Afc) /A 2 , < e Afc . By log-convexity of /(#) 
and V'(Afc) = A;, 



/(A fc (l-2a)) 
/(Afc) 

So, for a < <5fc we have 



< exp 



f(A fc ) 
/(Afc) 



(-Afc2a) 



-2afc 



(43) 



H k (a;X k ) < F(a fc ) + In 



1 + e- 



-2ak 



To simplify this bound, let us show that a k decreases with k. We write 

'iu./(*)Y_ i 



In- 



;G(x), 
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where 

x 2 f(x) 

Now G(+0) = - In 2 < 0, and 



G'(x) = 1 
x 



xf'(x) fxf'(x)^ 



f(x) V f( x 



<0, 



since 

xf'{x) 



lim 



s->0 f(x) 

and xf'(x)/f(x) is concave. Therefore G{x) < for x > 0, so that 
x ln(f(x)/x ) increases with x. So, by (j4"2j) . indeed decreases with 
A;. So, considering k > 7, we replace (j4"3j) by a cruder bound 

1 + e -2 "^ 

(44) H k (a; X k ) < H(a 7 ) + In , Va<d 7 . 

Numerically, 

A 7 « 6.953, d 7 « 0.2794, ff(a 7 ) « 0.5924. 
By (BID, H k (a;\ k ) < if 

a£| /3 7 :=-ln( — ]» 1.122. 

\k 2J 2 \2e- H ^)-l) 

If 0.999«fc, (see {J38J) for a k ), exceeds /3?/k for k > 7, then — in the light of 
(1391) — we will be able conclude that 



(45) H k (a,z*(a;c);c) < 0, Va £ (0,1/2], c < 1. 

And, in fact, the inequality 

tj- < 0.999a fc = 0.999^-, 
does hold for k > 7, since 

inf<-*l- : fc > ?1 = ^ = 4r « 1.248 > 1.123 - ^ 7 



7 7 /5 72/5 • • 0.999' 

(Needless to say, 7 happens to be the first integer j > 3 for which 

^< 0.999a fc , Vfc>i.) 

The values = 6, 5, 4 can be dealt with by pushing the above device a bit 
further. See Appendix (a). Thus, contingent on Lemma [U we have proved 
the k > 3 part of Lemma [JJ 

Proof of Lemma O (1) It suffices to prove convexity of <p(x) = x 2 /f(x), 
since 

x f '(x) 

^P(x) ■= ~j^y = x + 
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Now 



4>"{x) 



2x x 2 f'{x)\ 



fix) f(x) 2 

2 4xf'{x) x 2 f"(x) + 2x 2 {f{x)) 2 



Here 



and 



f(x) f{x) 2 f(x) 2 f{xf 

_ a{x) 

g(x) = 2f(x) 2 - Axf'{x)f(x) - x 2 f"(x)f(x) + 2x 2 {f(x)) 2 
9j = [2 j ~ 2 (j " 8) + J 2 - j + 4] > 0, V j > 6. 



Therefore <j)"(x) > for all x > 0. 
(2) By the definition of Hk(ct; A) in ([3 

gg^A) i/w)y H(a) + 



I /(A) 



<9A fc V /(A) 7 v l + /( A ( 1 ~ 2 »)) 

ww/ i+ /(A) 

/(A(l-2a)) 

= -^'(A)iJ(a) + A" 1 • ty(A(l - 2a)) - . 

Here, by concavity of ^(-), 

^(A(l - 2a)) - tp(X) > -2a\ip'(\), 

and, by the condition Hk(a; A) > 0, 

. . k , 2/(A) k , 2 

#(a) > -ttt In -tttt vT = T7TT ln 



V(A) /(A) + /(A(l-2a)) V(A) 1 + /(A( ^ 2a)) ' 
So, since ip'(X) > 0, we have dHk(a; X)/d\ > 0, if 

/(A(l-2a)) 

i + — fTO i + — f(xj 

Recalling log-convexity of /(■)> we a ls° have 
/(A(l-2 o)) 
/(A) 

So (H5T) definitely holds if 



< 



exp [— 2aij){Xj\ . 



ln z— >0, Vz>0. 

l + e~ z l + e~ z 
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And indeed, using convexity of ln(-), we get 

2 1 - e~ z ze~ z 
In > > 



l + e~ z 2 l + e~ z 

the last inequality being equivalent to sinhz > z. This concludes the proof 
of Lemma [5l whence of Lemma [H □ 
The next claim follows immediately from Lemma fl9l where z 2 = 0(A), 
and Lemma [H 

Corollary 7. Let e = 0.999a*: • For every k > 3 and c < 1, there exists 
7o — 7o( c ) k) > such that, for 2 < (. < em, 

Consequently, 

[em] 

e=2 



Furthermore, for c < 1 

E E [ y S]=°( e " n/3 )' 

£>em 

p = P{c,k) > 0. 

We conclude that, for c < 1, 



m 



-(fc-2)> 



The k > 3 part of Theorem [5] is proved. □ 

The steady rise of technicality from k > 7 to k = 6, 5, 4, in that order, 
makes it clear that the case k = 3, dealt with differently in [9], would likely 
be more challenging for our approach. Here is a modification of our argument 
above necessary to cover k = 3 as well. This time we are more dependent 
on computer than for k > 4. Like k > 4, we need to show that, for c < 1, 
h^{a; c) < for a G (0, 1/2]. Beginning with arbitrary > 3, we define 

zi=zaV z 2 = z(l-a)V, 

z = Aexp[— /c _1 -ff(a)] ; 

in particular, z 2 = ®(A). Notice that letting A; f co we get the equations (f33|) . 
What makes this selection of controllable zi , z 2 promising is an observation 
that for the exponent defined in f)19|) we have 

(Aa\ IT I - \ Tit \ , 1 /(^l+^) + /(Z2-2i) 

(48) Hk[a,z;c) = cH(a) + In 



2/(A) 

Af(A) /(z 1 +z 2 ) + /(z 2 -z 1 ; 

iife(a; A) := if (aj + In ■ 



fc/(A) v ' 2/(A) 
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a formula nearly as compact as (|34h for z\ = aX, z<i = (1 — a)A. (Less 
formally, the optimal (zf(a;c), z|(a;c)) happens to be given by (l47|) at 
stationary points of hf c (a;c) = inf z Hk(a, z; c), which is considered as a 
function of a and of a continuously varying parameter k.) Moreover, in a 
crucial analogy to Hk(ot] A), 

(49) ^-H k (a;X)>0, if i? fc (a;A)>0. 

The proof of (|49p runs parallel to that of Lemma [5] but more involved, see 
Appendix (b). 

Consequently, for k = 3, we need only show that H 3 {a\ A3) < for all 
a € (0, 1/2). Now, it follows from (07]) and (08]) that 

H 3 (a;X 3 ) = -2a + 0(a 4/3 ), a 1 0; 

i7 3 (a;A 3 ) = - 7 (l/2-a) 2 + 0((l/2-a) 3 ), a f 1/2; 

Thus F 3 (0; A 3 ) = i? 3 (l/2; A 3 ) = 0, and H 3 (a; A 3 ) < for a / 0, 1/2 either 
close to or to 1/2. Using Maple we have checked that the univariate 
function H 3 (a; A3) is negative for all a G (0, 1/2). 

The proof of Theorem [2] is complete. □ 

4. Solvability threshold for the unconstrained fc-XORSAT 

As we mentioned in Introduction, Dubois and Mandler had determined 
the threshold for the unconstrained 3-XORSAT by analyzing the termi- 
nal state of a reduction process applied to an instance of this problem. 
At each step of this process a variable appearing in only one equation is 
deleted together with the row corresponding to that equation. In graph- 
theoretical terms, this is a process that delivers a 2-core of the underlying 
3-uniform hypergraph. So the solvability threshold C3 is the edge density 
lim m/n for the unconstrained instance for which the edge density of its 
2-core is 1. Dietzefelbinger, Goerdt, Mitzenmacher, Montanari, Pagh, and 
Rink [7J [8] demonstrated that the known results on 2-cores of the random k- 
uniform hypergraph could be used for analogous treatment of unconstrained 
A;-XORSAT in general, for k > 3, if the constrained threshold was known 
to be 1. Let us show how the rigorous analysis of the cores by Molloy |14] 
delivers a short derivation of tit, for all k > 3. 

Let 

c = k min- -, — T , k > 3. 

x>0 (1 - e -x)fc-l' 

From \14:\ Theorem 1] it follows that the terminal matrix obtained via the 
reduction process from the random m x n matrix A, i. e. the 2-core of the 
underlying hypergraph, is with high probability (whp) empty if lim m/n < c. 
Hence, for lim m/n < c, whp Ax = b is solvable. 
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If on the other hand c := \xmm/n > c, then whp the terminal matrix has 
M = (1 — e x )n + o(n) columns, where x = x(c) is the largest solution to 

(50) C=k ~ 1 T> ~" \h 1 ' 

Importantly, x(c) strictly increases with c. In this case a variable (column) 
chosen uniformly at random from among the Af columns of the terminal 
matrix appears in a random number of equations (rows) of the terminal 
matrix whose asymptotic distribution is a Poisson Z(x) conditioned on the 
event {Z(x) > 2}. (See shortly before ([7]) for a reminder.) Therefore, 
denoting by A4 the number of rows of the terminal matrix, we obtain 

kE[M] ~ E[N] ■ E[Z(x) | Z(x) > 2}. 

Since A4 and TV are sharply concentrated around E[.M] and E[7V] respec- 
tively, we obtain that, in probability, the terminal edge density is asymptotic 
to 

*-E[2(«) | Z(z) > 2] = t-^f/f-'? = *- ^ ~ '> 



¥(Z(x) > 2) e x - 1 - x 

We emphasize that, conditioned on the row column set having cardinalities 
A4,Af respectively, the terminal matrix is distributed uniformly on the set 
AmX' if we label rows and columns by the elements of [M] and [Af]. Let 
Xk be the unique positive root of 

AT 1 X x k{eX " - 1} = 1 <=> (x fc - fcje 8 * + A; + (fc - l)x fe = 0. 



It follows then from (|50p that cf, the threshold edge density of the uncon- 
strained matrix A is given by 

Cfc (l _ g-Xfe^fc-i ' 

The equations ©-([H) are proved. □ 
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Appendix (a), k = 6,5,4 cases. 

Let k = 6. Define xq = oq, and recursively 



(51) 




The recurrence ([5T]) is equivalent to 



1 + e 



-2-6xj 



H{ Xj -i)+\n 



2 



= 0. 
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H(x) is strictly increasing on [0, 1/2], and ln(l + e~ 12x ) is strictly decreasing. 
So if Xj < Xj-i for some j > 1 then 

H(x) + ln < 0, Vx 6 [xj,Xj-i]. 

Consequently, if xq > x\ > ■ ■ ■ > xj then 

(52) H 6 (a;\ 6 )<0, V a G [xj , x ] = [xj , a 6 ] , 

whence 

# 6 (a;A 6 )<0, Vae[ij,l/2). 

Numerically, 

x = a 6 w 0.302, xi « 0.207, x 2 « 0.134, 
i. e. xo > xi > X2 indeed. And we see that, by the definition of a.Q in (j38|h 

x 2 < 0.180 < 0.185 » 0.999a 6 = 0.999 



6 

66=2 



Combining (|52p and (|39p we obtain that 

# 6 (a,z(a;c);c) < 0, Va < 1/2, c < 1. 

Let = 5. Define {xj} by the recurrence obtained from (I5ip . with 6 
replaced by 5, and xq = a$. This time we get xo > x\ > xi > X3 w 0.177, 
while «5 w 0.186. So, as X3 < 0.999as, we have 

i?5(a,z(a;c);c) < 0, Va < 1/2, c < 1. 
Let k = 4. Here 

e 

xo = 04 ~ 0.374 > a 4 = — , 

16 

but xi « 0.429 > xo- In this case we introduce a sequence {yj}, such that 
y = 0.999a 4 « 0.170 and, for j > 1, 

/(A 4 ) + /(A 4 (1- 2^_ 1 )) 
2/(A 4 ) 

(cf. the definition of Hk(a; Afc) in (I34|) ). or equivalently 
/kq\ Tj-i [1 2 /(A 4 ) 

(53) m=H r /M+z^d-^-,)) . 

Here i/ _1 (x) is inverse to the entropy function, defined for x € [0, In 2]. The 
function 

, /(A 4 ) + /(A 4 (l-2y)) 
2/(A 4 ) 

is strictly decreasing. Therefore, as before, if yo < y\ < ■ ■ ■ < yj for some 
j > 0, then 

H k (a; A fc ) = ff(a) + ln /(A4) + *^~ 2a)) < 0, Va € [0.999a 4 , y,-]. 
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In this case we get yo < y\ < ■ ■ ■ < y-j sa 0.381 > 04. Therefore H^a; A4) < 
for all a G [0.999a 4 , 1/2), whence 

H 4 (a,i{a;c);c) < 0, VaG (0,1/2], c < 1. 

The reader may wonder why we did not use a cruder recurrence 



y.i 



H- 1 ( s + e-Wi )' (yo = 0.999a 4 ). 



The reason is that in this case lim J _). 00 j/j — >• y ~ 0.274, the root of 

F(y)+ln^ = °" 

That is, this sequence {yj} forever stays below 04 « 0.374! 

Appendix (b). Suppose a 6 (0, 1/2]. Let us show that dH^^a; X)/d\ > 
if Hk(a; A) > 0. Introduce 



u = u (a) := e - H ^' k [(1 - a) ¥ - a V " 
' [(1 — a) k + a k 



v = v(a) :=e- H ^/ k \ 



k-l - 



so that 



H k (a;X) = — — H(a)+ In — — — , [ip(x)~ 



k v/ 2/(A) ' V /(*) 

Obviously, u < 1 and it can be shown that v > 1. The condition Hk(ot\ A) > 
and log-convexity of /(x) imply that 

(55, M JM > h - VtA) 



Further 

,56) ^ A) = ^ Wo)+ » ta /C !! ^. 

Here 

{ f(uX)\' , { f(u\)\' 
d_ f(uX) + f(v\) _ {7W) X + {7W) X 
OX 2f(X) f(vXl f(vX}_ 

V 7 /(A) + /(A) 



and, using concavity of i/j(x), 
'f(uX)\' f(uX) 



/(A) J x A/(A) 



(nA)-V(A)] 



and likewise 
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So, using also log-convexity of f(x) and u — 1 < 0, 



(57) ^MjM. '-'^-V w 



2/(A) 



1 + 



> (v-l) + (u- 1) exp [>(t;A)(u - v)/v] ^ 

1 + exp [i/j(vX) (u — v )/ v\ 

(tt-l) + (tt-l)expty(A)fr-l)] 

l + exp^(A)(u-l)] ^ lAj ' 

we used v > max{-u, 1}. Combining ([55]) - ([57]) . we see that dHk(a; X)/d\ > 
if 

. 2 y + xe^ 

^(x, j/) := In + - > 0; 

e x + I + e x 



here 
By ([MD, 



s = ^(A)(u-l), y = ^(A)(u-l). 

y-x=2V'(A)e-^ (a)/A: Q^ i , 

x= V>(A)e-^/ fc [(l-a)^ 



a ^ _ e *(«)/*l 



< -^(A) e "^)/ fc a^, 



so that 

(58) 

Since 



x < 



y - x 



g(x,y) = In 



y-x 



l + e -{y-x) l + e -x' 
we see that, under condition ([58]) . g(x,y) > if 

0(*):=ln— — ^ >0, Vz>0. 
1 + e 2 1 + e 2 /^ 

This is indeed so, since 0(0) = 0, and 



b'( z ) =(l + e z )(l + e^ 2 )- 2 ^ 



i>3 



> 0. 
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